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a npw class of cngywoH in the bi o syn thefc i c? nathwav for the Product i oiL-gf 
hriaoylgl yr^rol and recombinant DMA molecules encoding these enzvmes 

Applicants and inventors: 

Anders Dahlqviet, Ulf Stahl, Marit Lenman, Antoni Banas, Hans Ronne and 
Sten Stymne 

Description 

The present invention relates to the isolation, identification and 
characterization of recombinant DNA molecules encoding enzymes catalysing 
the transfer of fatty acids from phospholipids to diacylglycerol in the 
biosynthetic pathway for the production of triacylglycerol . 

The invention further relates to novel type of enzymes, the genes encoding 
such enzymes and the use of these genes for transformation. More 
specifically, the invention relates to use of a previously un-described 
type of enzymes hereinafter designated phospholipid diacylglycerol 
acyltransf erases { PDAT) and the genes encoding them. This type of genes 
expressed alone in transgenic organisms will enhance the total amount of 
oil (triacylglycerols) produced in the cells. The PDAT genes, in 
combination with a gene for the synthesis of an uncommon fatty acid will, 
when expressed in transgenic organisms , enhance the levels of the uncommon 
fatty acids in the triacylglycerols. 

There is considerable interest world-wide in producing chemical feedstock, 
such as fatty acids, for industrial use from renewable plant resources 
rather than non-renewable petrochemicals. This concept has broad appeal to 
manufacturers and consumers on the basis of resource conservation and 
provides significant opportunity to develop new industrial crops for 
agr i cul ture • 

There is a diverse array of unusual fatty acids in oils from wild plant 
species and these have been well characterised (see e.g. Badami & Patil, 
1981) • Many of these acids have industrial potential and this has led to 
interest in domesticating relevant plant species to enable agricultural 
production of particular fatty acids. 

Development in genetic engineering technologies combined with greater 
understanding of the biosynthesis of unusual fatty acids now makes it 
possible to transfer genes coding for key enzymes involved in the synthesis 
of a particular fatty acid from a wild species into domesticated oilseed 
crops. In this way individual fatty acids can be produced in high purity 
and quantities at moderate costs. 
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In all crops like rape, sunflower, oilpalm etc., the oil (i.e. 
triacylglyccrols) is the most valuable product of the seeds or fruits and 
other compounds like starch, protein, and fibre is regarded as by-products 
with leas value. Enhancing the quantity of oil per weight basic at the 
expense of other compounds in oil crops would therefore increase the value 
of crop. If genes, regulating the allocation of reduced carbon into the 
production of oil can be up-regulated, the cells will accumulate more oil 
on the expense of other products. Such genes might not only be used in 
already high oil producing cells such as oil crops but could also induce 
significant oil production in moderate or low oil containing crops such as 
e.g. soy, oat, maize, potato, sugarbeats, and turnips as well as in micro- 
organisms* 

Summary of the invention 

Many of the uncommon fatty acids of interest, eg. medium chain fatty 
acids , hydroxy fatty acids, epoxy fatty acids and acetylenic fatty acids, 
have physical properties that are distinctly different from the common 
plant fatty acids. The present inventors have found that, in plant species 
naturally accumulating these uncommon fatty acids in their seed oil (i.e. 
triacylglycerol) , these acids are absent, or present in very low amounts in 
the membrane (phospho) lipids of the seed. The low concentration of these 
acids in the membrane lipids is most likely a prerequisite for proper 
membrane function and thereby for proper cell functions. One aspect of the 
invention is that seeds of transgenic crops can be made to accumulate high 
amounts of uncommon fatty acids if these fatty acide are efficiently 
removed from the membrane lipids and channelled into seed triacylglycerols . 

The inventors have identified a novel class of enzymes in plants catalysing 
the transfer of fatty acids from phospholipids to diacylglycerol in the 
production of triacylglycerol and, presumably, lysophospholipids and that 
these enzymes {phospholipid ;diacylglycercl acyl transferases abbreviated as 
PDAT) are involved in the removal of hydroxylated, epoxygenated fatty 
acids, and probably also other uncommon fatty acids such as medium chain 
fatty acids, from phospholipids in plants. Further, the same enzyme 
reaction was shown to be present in microsomal preparations from baker *b 
yeast {Saccharomyces cerevialae) . A so called , knock out* yeast mutant, 
disrupted in the respective gene was obtained and microsomal membranes from 
the mutant was shown to totally lack PDAT activity. Thus, it was proved 
that the disrupted gene encodes a PDAT enzyme (SEC XD NO: 1 and 2). In 
addition, a gene from Schizosaccharomyc*es pombe, SPBC776.14 (SEQ ID. NO. 
3), having 46% identity over 571 amino acids, and three Arabidopeia 
thaliana genomic sequences coding for putative proteins with amino acid 
sequence having approximately 44 % identity over a total of 378 amino acids 
(SEQ ID NO : 4), 47 % identity over 73 amino acids (SEQ ID NO: 9), and 33% 
identity over 105 amino acids (SEQ ID NO. 10), with the yeast enzyme, were 



identified. 

Also, a partially sequenced cDNA clone from Neuroapora crassa (SEQ ID NO. 
8) showed 42% identity over 167 amino acids, and a Zea ways est clone (SEQ 
ID NO* € and 7) showed 36% identity over 9€ amino acids, with the yeaat 
enzyme were identified. Finally, two cDNA clones were identified, one 
jirabidopsis thaliana est (SEQ ID NO. 5) which was 98% identical in 
nucleotide sequence to the Arabidopsis thaliana genomic sequence (ID no. 
4), and a hycoperBicon esculentum est clone (SEQ ID NO. 11) showed 84% 
identity in 72 nucleotides to Seq ID No. 10. 

In a first embodiment, this invention is directed to a PDAT enzyme and 
nucleic acid sequences that encode a PDAT. This includes biologically 
active PDATs and encoding sequences for PDATs as well as sequences that are 
to be need as probes, vectors for transformation or cloning intermediates* 
The PDAT encoding sequence may encode a complete or partial sequence 
depending upon the intended use. All or a portion of the genomic sequence, 
cDNA sequence, precursor PDAT or mature PDAT is intended. 

In a different aspect, this invention relates to a method for producing a 
PDAT in a host cell or progeny thereof, including genetically engineered 
oil seeds, yeast and moulds or any other oil accumulating organism, via the 
expression of a construct in the cell. Cells containing a Pdat as a result 
of the production of the PDAT encoding sequence are also contemplated 
within the scope of the invention. 

In a different embodiment, thia invention also relates to methods of using 
a DNA sequence encoding a PDAT for increasing the oil-content within a 
cell. 

Another aspect of the invention relates to the accommodation of high 
amounts of uncommon fatty acids in the triacylglycerol produced within a 
cell, by introducing a DNA sequence producing a PDAT that specifically 
removes these fatty acids from the membrane lipids of the cell and channel 
them into triacylglycerol. Plant cells having such a modification are also 
contemplated herein. 

A PDAT of this invention includes any sequence of amino acids, such an a 
protein, polypeptide or peptide fragment obtainable from a microorganism, 
animal or plant source that demonstrates the ability to catalyse the 
production of triacylglycerol from a phospholipid and diacylglycerol under 
enzyme reactive conditions. By, enzyme reactive conditions" is meant that 
any necessary conditions are available in an environment (e.g., such 
factors as temperature, pH, lack of inhibiting substances) which will 
permit the enzyme to function. 
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Other PDATs are obtainable from the specific sequences provided herein. 
Furthermore , it will be apparent that one can obtain natural and synthetic 
PDATs, including modified amino acid sequences and starting materials for 
synthetic-protein modelling from the exemplified PDATs and from PDATs which 
are obtained through the use of such exemplified sequences. Modified amino 
acid sequences include sequences that have been mutated, truncated, 
increased and the like, whether such sequences were partially or wholly 
synthesised, Sequences that are actually purified from plant preparations 
or are identical or encode identical proteins thereto, regardless of the 
method used to obtain the protein or sequence, are equally considered 
naturally derived. 

The nucleic acid sequences (DNA and RNA) of the present invention can be 
used to screen and recover * homologous" or , "related" PDATS from a variety 
of plant and microbial sources. 

Further, it is also apparent that a person skilled in the art can, with the 
information provided in this application, in any organism identify a P&AT 
activity, purify an enzyme with this activity and thereby identify a ,non- 
nomologues" nucleic acid sequence encoding such an enzyme. 



BUIEF DESCRIPTION OF THE FIGURES 
FIG* 1. 

PDAT activity in microsomal fractions of S. cerevisiae. Aliquotc of 
lyophilised microsomal membranes (10 nmol phosphatidylcholine) from a wild 
type yeast (strain YN979) (lane 1*3, 3) ( a yeast mutant (strain B10280) , 
disrupted in the VNR008W gene (lane 4-6, 3) or the yeast mutant 
complemented with a single copy plasmid containing the PDAT gene (lane 7) 
were assayed for PDAT activity, 2 nmol sn-l-oleoyl-sn-2- [ X4 C] - 
ricinoleoylphosphatidylcholine {lane 1-7) or en-l-oleoyl-sn-2- I"C] -oleoyl- 
phosphatidyicholine (lane 8-9) and 5 nmol of dioleoyl-diacylglycerol (lane 
2, 5 , 7,9) or rac-oleoyl-vernoloyldiacylglycerol (lane % s G) were added in 
benzene solution. The benzene was evaporated under N a (g> and 0*1 ml of 50 
mM potassium phosphate, pH 7.2, was added. The suspension was thoroughly 
mixed and after SO min at 30 C the lipids were extracted in chloroform and 
separated an thin layer chromatography on silica gel 60 plates in 
hexan/dietyletber/acetic acid (35:70:1.5). The radioactive lipids were 
visualised and quantified an the plates by electronic autoradiagraphy 
(Instant imager, Packard, US) . Abbreviations used: triacylglycerol , TAG, 
FA, fatty acid (i.e, oleic acid); 1-OH-TAG, monoricinoleoyl - 
triacylglycerol,- 1 -OH-l-epTAG, monoricinoleoyl -monovernoleoyl- 
triacylglycerol nd OH- FA, ricinoleic acid. 

Brief Description of the SEQ ID: 
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SEQ ID NO. 1: Genomic DNA sequence of the Saccharomyces cerevisiae PDAT 
gene, YNROOSw, genebank nucleotide ID number 1302481, and the suggested 
YNR008W amino acid sequence. 

SEQ ID NO. 2: The suggested amino acid sequence of the yeast gene YNROOSw 
from saccharomyces cerevisiae. 

SEQ ID WO. 3: Genomic DNA sequence of the Schi zosaccharomyces pomb& gene 
SPBC776.14- 

SEO ID NO. 4: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AB00S704. 

SEQ ID NO. 5: Nucleotide sequence and the corresponding amino acid sequence 
of the Arabidopsls thaliana est clone with genebank accession number 
T048C6, and I'D number 315966. 

SEQ ID NO. 6: Nucleotide and amino acid sequence of the Zea mays cDNA clone 
with genebank ID number g4 3 88 167. 

SKQ ID NO. 7: Amino acid sequence of the Zea mays cDNA clone with genebank 
ID number g436Bi67- 

SEQ ID NO. 8 : DNA sequence of part of the Neurospora crassa cDNA clone 
VI07G1, ID nunber g4241729« 

SEQ ID NO. 9: Genomic DNA sequence of part of the Arabidopsxs thaliana 
locus with genebank accession number AC004557. 

SEQ ID NO. 10: Genomic DNA sequence of part of the Arabidopsis tbaliana 
locus with genebank accession number AC003027. 

SEQ ID NO. 11: DNA eequencce of part of the Lycopersicon esculentiim cDNA 
clone with genebank accession number AI486635. 

The present invention can be essentially characterized by the following 
aspects : 



1. 
2, 



3- 



use of a PDAT PDAT gene (genomic clone or cDNA) for transformation. 
Use of a DNA molecule according to item 1 wherein said DNA is used for 
transformation of any organism in order to be expressed in this 
organism and result in an active recombinant PDAT enzyme in order to 
increase oil content of the organism. 

Use of a DNA mol cule of item 1 wherein said DNA is used for 
transformation of any organism in order to prevent the accumulation of 
undesirable fatty acids in the membrane lipids. 
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4. Use according to item 1, wherein said PDAT gene is used for 
tranef crming transgenic oil accumulating organisms engineered to 
produce any uncommon fatty acid which is harmful if present in high 
amounts in membrane lipids, such as medium chain fatty acids , 
hydroxylated fatty acids , epoxygenated fatty acids and acetylenic 
fatty acids. 

5. Use according to item 1, wherein said PDAT gene is used for 
transforming organisms, and wherein said organisms are crossed with 
other oil accummulating organisms engineered to produce any uncommon 
fatty acid which is harmful if present in high amounts in membrane 
lipids, comprising medium chain fatty acids, hydroxylated fatty 
acids, epoxygenated fatty acids and acetylenic fatty acids. 

6. Use according to item 1, .wherein the enayme encoded by said PDAT gene 
or cDNA is coding for a PDAT with distinct acyl specificity, 

7. Use according CO item 1 wherein said PDAT encoding gene or cDNA, ib 
derived from Sa ccha roxmyces cereviseae, or contain nucleotide 
sequences coding for an amino acid sequence 30% or more identical to 
the amino acid sequence of PDAT as presented in SEQ. ID. KO, 2. 

6. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharorwycea cereviseae, or contain nucleotide 
sequences coding for an amino acid sequence 40% or more identical to 
the amino acid sequence of PDAT as presented in SEQ. ID. NO. 2. 
Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharonryces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 60% or more identical to the amino 
acid sequence of PDAT as presented in SEQ, ID. NO, 2. 

10. use according to item 1 wherein said PDAT encoding gene or cPNA is 
derived from Saccharomyccs cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 80% or more identical to the amino 
acid sequence of PDAT as presented in SEQ. ID. NO* 2. 

11. Use according to claim 1 wherein said PDAT encoding gene or cDNA is 
derived from plants or contain nucleotide sequences coding for an 
amino acid sequence 40% or more identical to the amino acid sequence 
encoded by the PDAT gene from AraJbidopsi* thaliana as presented in 
SEQ. ID. NO, 4, 9 or 10, or to the protein encoded by the ful length 
counterpart of the partial Zea mays, Lycopersican esculentum, or 
NevrosporsL crassa cDNA clones. 

12. Transgenic oil accumulating organisms comprising, in their genome, a 
PDAT gene transferred by recombinant DNA technology or somatic 
hybridization. 

13. Transgenic oil accumulating organisms according to item 12 comprising, 
in their genome, a PDAT gene having specificity for substrates with 
particular uncommon fatty acid and the gene for said uncommon fatty 
acid. 

14. Transgenic organisms according to item 12 or 13 which are selected 
from the group consisting of fungi, plants and animals. 
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15. Transgenic organisms according to item 12 or 13 which are selected 
from the group of agricultural plants. 

16. Transgenic organisms according to item 12 or 13 Vhich are selected 
from the group of agricultural plants and where said PDAT gene is 
expressed under the control of a storage organ specific promotor. 

17. Transgenic organisms according to item 12 or 13 which are selected 
from the group of agricultural plants and where said PDAT gene is 
expressed under the control of a seed promotor . 

18. Oils from organisms according to item 12 - 17* 

19. A method for altering acyl specificity of a PDAT by alteration of the 
nucleotide sequence of a naturally occurring encoding gene and as a 
consequence of this alternation creating a gene encoding for an enzyme 
with novel acyl specif ity. 

20. A protein encoded by a DMA molecule according to item 1 or a 
functional fragment thereof. 

21. A protein of item 20 designated phospholipid :diaeylglyeeriol 
acyltransf erase, 

22. A protein of item 21 which has a distinct acyl specificity. 

23. A protein of item 13 having the amino acid sequence as set forth in 
Scg id no, 2, (and the proteins encoded by the genes fullength or 
partial genes set forth in SEQ ID NO. 1, 3, 4, 5, 6, 7, 8, 9, 10 or 
Xl) or an amino acid sequence with at least 3 0 % homology to ,eaid 
amino acid sequence. 

24. A protein of item 23 isolated from saccharomyces cereviseae. 

EXAMPI/B 1 

Petermination of substrate utilisation by the acyl-CoA independent 
acyltransferase in the synthesis of triacylglycerols in microsomal 
preparations of developing castor bean endosperm and bakers yeast. 

Microsomal membranes prepared from developing endosperm of castor bean 
(Ricinus communis) catalyse the selective transfer of rieinoleoyl- (12- 
hydroxy-9-octadecenoyl) and vernoloyl (12 -epoxy-9-octadeeen.oyl) groups from 
both diacylglycerols and phosphatidylcholine into triacylglycerols. The 
substrate utilisation was investigated in castor bean microsomes by using 
radioactive sn-l-oleoyl-sn-2- [ l4 C] ricinoleoyl-diacylglycerol (sn-2- ( 14 CJ - 
ricinoleoyl-diacylglycerol) or sn-l-oleoyl-sn-2- ( L *C) ricinoleoyl- 
phocphatidylcholine (sn-2- ("c3 -ricinoleoyl -phosphatidylcholine) together 
with different non-radioactive diacylglycerol (see Table 1) . 

The preparatian of microsomal tractions of developing castor bean endosperm 
and freeze drying of the microsomes were performed in known manner. Assays 
with addition of diacylglycerol and phosphatidylcholine substrates were 
performed. The results is presented in Table 1 and showed that if 
radioactive sn-2- t 14 C] ricinoleoyl-diacylglyoerol was used as the only added 
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substrate, 2.8 % of the radio-labelled ricinoleoyl chains were found in 
triacylglycerol with one ricinoleoyl group, 12.4 % of the radioactivity was 
found in triacylglycerol-species with two ricinoleoyl groups and only trace 
amounts were associated with triacylglycerol consisting of three 
ricinoleoyl groups. If incubations with sn-2- [ U C] ricinoleoyl - 
diacylglycerol were performed in a 1:4 (mol:mol) mixture with non- 
radioactive diacylglycerol species containing one vernoloyl group, the 
distribution of radioactivity between different molecular species of 
triacylglycerol changed only marginally compared to incubations with just 
radioactive substrate. Only 1.3% of the added "C- labelled ricinoleoyl 
groups were metabolised into triacylglycerol species with one ricinoleoyl 
and one vernoloyl group. Similarly, only marginal changes in the 
radioactive triacylglycerol molecular species was seen in incubations where 

I"C) -ricinoleoyl -diacylglycerol was mixed with non-labelled 
divernoloyl-diacylglycerol ♦ However, by adding unlabelled diricinoleoyl- 
phosphatidylcholine together with en- 2 - t l4 C] -ricinoleoyl -diacylglycerol the 
radioactivity metabolised into the different triacylglycerol species wore 
substantially altered. 

Only trace amounts of radioactivity were detected in triacylglycerol 
species with one ricinoleoyl chain whereas the radioactivity in 
triacylglycerol with two ricinoleoyl groups were doubled as compared to 
incubations with only sn-2 - ["c] -ri cinoleoyl-diacylglycerol added. 
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EXAMPLE 2 

Transformation and expression of YNROORw gene in yeast 

The yeast mutant (strain B 10260) disrupted in the TOROOSw gene, was 
transformed with the single copy plasmid pFL3 9 having the PDAT-gene 
(YNROOflw) under the control of the endogenous promoter region (583 bp 5* 
untranslated] inserted into the cloning cassette. The transformed yeast 
was pre-cultivated at 2S 6 C for 20 h in defined YNB medium without 
tryptophane added. Cells were harvested and re-suspended in minimal medium 
(Meesters el al . , 1995), supplemented with 16 g/1 glycerol to the original 
volume of the growth culture. The culture was further incubated for 24 h 
after which cells were harvested by centrifugation . Microsomal fraction of 
the yeast was prepared as deseribed in Example 1 above and was incubated in 
the presence of en- 2- [ 14 C] - ricinoleoyl -phosphatidylcholine (Fig 1, lane 7). 
This experiment clearly shows that the PDAT activity could be restored by 
the expression of the YNROOBw gene in the mutant yeast strain B10280 
normally lacking the PDAT-activi ty . 

The effect of the over- express ion of the PDAT gene on the lipid 
accumulation was studied by transforming the wild-type yeast (strain SCY62) 
with a plasmid pJW92 containing the PDAT gene <YNR008w) under the control 
of a GALl-promotor . The transformed yeast was then cultivated at 2D °C in 
defined YBN medium lacking uracil. The expression of the PDAT gene was 
induced by the addition of 2 % (vlv) galactose after 10 hours growth and 
was further incubated for 18 hours- The yeast cells were harvested and the 
lipid content of the yeast was analysed by thin layer chromatography and 
gas liquid chromatography, The results are presented in Table 2. The total 
lipid content in the yeast with the over- expressed PDAT was 1.3 fold higher 
that in the control yeast transformed with an empty plasmid ptJN92 . The 
expression of the PDAT gene had no effect on the growth rate as determined 
by optical density measurements. The elevated lipid content in the yeast 
transformed with PDAT as compared to the control yeast can be totally 
accounted for by on 80 % increase seen in the triacylglycerol content. The 
levels of the polar lipids and sterol esters were not significantly 
effected by the over-expression of the PDAT gene. Hence , these results 
clearly demonstrate the use of the PDAT gene in increasing the oil content 
in transgenic organisms. 
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Tabic 2. IipiV content in yeast with c verexpressed PDA T gene. 

Yeast transformed with the PDAT gene under the control of a gal promoter tn a xxxx plasmid were grown in 
W3 medium at 2S °C. After after 10 hours growth the expression of the PDAT gene was induced by the 
addition of 2 «/o (v/v) galactose. Ceils were harvested after additional 1 8 hours growth and analysed for its hpid 
content Yeast transformed with an empty plasmid grown tinder identical conditions was used as control Lipids 
were extracted in chloroform, fractionated onTLC and quantified by GC analyses. The lipid content was 
measured as nmol fatly acids (FA) per mg yeast (dry weight). 





PDAT-transfonned yeast 


Control yeast 






nmol FA/mg 


(%) 


nmol FA/mg 


(%) 


'Poiar lipids 
Sterol esters 
Triacylglyccrol 
Other lipids 
Sum 


60 
17 
105 
6 

IBS 


32 
9 

56 

3 


60 
20 
58 
6 

144 


42 
14 
40 
4 
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Claims 

1. An enzyme catalysing the transfer of fatty acids from 
phospholipids to diacylglycerol forcing triacylglycerol , 

2. A recombinant DNA molecule encoding the enzyme of claim 1* 

3. A recombinant DNA molecule according to claim 2 encoding an 
enzyme catalysing the transfer of fatty acids from 
phospholipids to diacylglycerol in the biosynthetic pathway 
for the production of triacylglycerol , wherein said enzyme 
comprises an amino acid sequence set forth in SEQ ID NO. 2, 
or fragments thereof, or is encoded by the genes set forth 
in SEQ ID NO, l # 3, 4, 9 or 10, or is encoded by the 
fullenght genes corresponding to the partial nucleotide 
sequences set forth if SEQ ID NO, 5, 6, 7, 8, and 11 or 
sequences 40% or more identical to said sequences. 

4. The DNA molecule of claim 2 or 3, wherein said enzyme is 
designated as phospholipid ; diacylglycerol acyl transf erease . 

5. A vector comprising a DNA molecule of claim 2 or 3, 

6. A vector of claim 5 further comprising a selectable marker 
gene - 

7. A host cell containing a DNA molecule of claim 2 or 3 . 

8. The host cell of claim 7 which is a plant cell or yeast 
cell - 

9. A process for the production of transgenic yeast cells, 
plant cells cr plants comprising a) transforming a DNA 
moleclule of claims 2 or 3 into plant cells or plants; and 
b) selecting of transformed plant cells or plants having an 
altered biosynthetic pathway in the production of 
triacylglycerol . 

10. A process of claim 9 wherein the altered biosynthetic 
pathway is characterised by an increased or altered oil 
content . 

11. A process of claim 9 wherein the altered biosynthetic 
pathway is characterised by the prevention of accumulation 
of undesirable fatty acids in the membrane lipids. 
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12. A protein encoded by a DNA molecule according to claim 2 or 
3 or a functional fragment thereof. 

13, a method for production of triacylglycerol, comprising 
enzymatic catalysation by an enzyme catalysing the transfer 
of fatty acids from phospholipids to diacylglycerol forming 
triacylglycerol . 



14. A method according to claim 13, comprising enzymatic 

catalysation by a protein or functional fragment according 
to claim 12. 



The present invention relates to a novel class of enzyme catalysing 
the transfer of fatty acids from phospholipids to diacylglycerol in 
the biosynthetic pathway for the production of triacylglycerol . 
Further, the invention relates to the isolation, identification, 
chai-acterization and use of recombinant DNA molecules encoding this 
clas3 of enzymes- 



15 

SEQUENCE LISTING 
<110> Styrone Dr., Sten 

<12 0> RECOMBINANT DNA MOLECULES ENCODING A NEW CLASS OF 

ENZYMES IN THjS BIOSYNTHETIC PATHWAY" FOR THE PRODUCTION 
OF TRXACYLGLYC5ROL 
<130> stymne 
<140» 
*141> 
<160> 11 

<170> 

<210> 1 
<211> 19S6 
<212> genomic DNA 
<213> Saccharomyces cerevisiae 
<220> 
<221> CDS 
<222> (1) • - (15B3) 
<:400> 1 

atg ggc aca ctg ttt cga aga aat gtc cag aac caa aag agt gat tct 4 8 
Met Gly Thr Leu Phe Arg Arg Asn val Gin Asn Oln Lys Ser Asp Ser 
15 10 15 



gat gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 2S 30 



96 



aac cac att cat cat coa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
25 40 45 

att agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
He Ser Gly Eer Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 €0 

aaa aga gac ggg aac ggt aga aaa cgt tgg aga gat tec aga aga ctg 24 0 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 60 

att ttc att ctt ggt gca ttc tta ggt gta ctt ttg ccg ttt age ttt 286 
He Phe lie Leu Gly Ala PHe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aat age gat age gac ttg ttt gac aac ttt 3 36 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp L u Phe Asp Asn Phe 
100 105 110 
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gta aat ttt gat tea ctt aaa gtg tat ttg gat gat egg aaa gat gtt 334 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt ata agt teg ttt att gat gat att cag get ggt aac 432 
Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 

tac tec aca tct tct tta gat gat etc agt gaa aat ttt gec gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gec aaa cat cct gtt gca 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 1*70 175 

atg gtt cct ggt gtc att tct acg gga att gaa age tgg gga gtt att 576 
Mot Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

gga gac gat gag tgc gat agt tct gcg eat ttt cgt aaa egg ctg tgg 624 
Gly 7*sp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly ser Pbe Tyr Met Leu Arg Thr Met val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 720 
Leu Lye His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 768 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp He Trp Asn LyB Val Phe Gin Asn Leu Gly Val He 
260 265 270 



ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 
Gly Tyr Glu Pro Asn Lye Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 2 85 
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gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 23 s 

gaa caa ate gaa Ctg fctt cat Caa ttg agt ggt gaa aaa gtt tgt tta 
Glu Gin He Glu Leu Phe Kis Gin Leu Ser Gly Glu Lye Val Cys Leu 
305 310 315 320 

«itt gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 
He Gly His Ser Met Gly Ser Gin He He Fhe Tyr Phe Met Lya Trp 
325 330 335 



912 



960 



aooe 



gtc gag get gaa ggc cot ctt tac ggt aat ggt ggt cgt ggc tgg gtt 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 

340 345 350 



aac gaa cac ata gat tea ttc att aat gca gca ggg acg ctt ctg ggc 
Asn Glu Hie He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 



1104 



get cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat acc 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

act caa tta aat acg tta gec atg tat ggt ttg gaa aag ttc ttc tea 

He Gin Leu Asn Thr Ircu Ala Met Tyr Gly Leu Glu Lye Phe Phe Ser 

385 390 395 400 



1152 



17.00 



aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 
Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 410 415 



1248 



atg cca cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 
Mot Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 

tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asa Phe He 
435 440 445 

cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 
Arg Phe Glu Arg Asn Thr Ser Aep Ala Phe Asn fcys Asn *eu Thr Met 
450 455 460 



1296 



1344 



1392 



aaa gac gee att aac atg aca tta teg ata tea cct gaa tgg etc caa 
Lys Asp Ala He Asn Met Thr Leu Ser He ser Pro Glu Trp Leu Gin 

470 <75 480 



465 



1440 



18 




a ga aga gta cat gag cag tac teg tfcc ggc tat tec aag aat gaa gaa 14S8 
Arg Arg val His Glu Gin Tyr fier Phe Gly Tyr Ser Lys Asn Glu Glu 
4B5 490 495 

gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 1536 
Glu Leu Arg Lys Asn Glu Leu K.ie His Lys His Trp Ser Asn Pro tfet 
500 SOS 510 

gaa gta cca ctt cca gaa get ccc cac atg aaa ate tat tgt ata tac 15B4 
Glu Val Pro beu Pro Glu Ala Pro His Met Lys He Tyr Cys He Ty^ 
515 S20 525 

999 gtg aac aac cca act gaa agg gca tat gta tat aag gaa gag gat 1632 
Gly val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lye Glu Glu Asp 
S30 535 540 

gac tec tct get ctg aat ttg acc ate gac tac gaa age aag caa cct 1680 
Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lye Gin Pro 
545 550 555 560 

gta ttc etc acc gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 1728 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala Mis Ser 
S65 570 575 

atg tgt cac aaa tgg gec cag ggt get tea ccg tac aac cct gee gga 1776 
Met Cya His Lye Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 530 

att aac gtt act att gtg gaa atg aaa cac cag cca gat cga ttt gat 1624 
He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 500 €05 

ata cgt ggt gga gca aaa age gec gaa cac gta gac ate etc ggc age 18*72 
He Arg Gly Gly Ala Lys Ser Ala Glu Hie Val Asp He Leu Gly Ser 
610 615 620 

gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat X920 
Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag cca cgc caa ttg tct aa.t ttg age cag tgg gtt tct cag 19 6B 
Leu Val Glu Pro Arg Gin Leu Ber Asn Leu Ser Gin Trp Val Ser Gin 
€45 650 655 
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atg ccc ttc cca atg taa 1986 
Met Pro Phe Pro Met 
660 



e-99 ; 018^B^ 
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<210> 2 
<211> 661 
<212> PRT 

<213> saccharomyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Aen Val Gin Asn Gin Lys Ser Acp Ser 

15 10 15 

Asp Glu Abu Asn Lys Gly Gly Ser Val His Asn Lys Arg GIu Ser Arg 
20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Aep Arg 

50 55 60 

Lys Arg 7i3p Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phc 

85 90 95 

Gly Ala lyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 

100 105 110 

val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lye Asp Val 

115 120 .125 

Leu Pro Glu Gly lie Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 

130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 

165 170 175 

Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 

180 1S5 190 

Gly Aep Asp Glu cys Asp ser Ser Ala His Phe Arg Lye Arg Leu Trp 

195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 

210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 

245 250 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 

260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 

275 260 285 

A.la Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lye 

290 295 300 

Glu Gin He Glu Leu Phe His Gin Leu ser Gly Glu Lys Val Cye Leu 
305 310 315 320 
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lie Gly Has 
Val Glu Ala 



Glu 
340 
He 



Asn Glu Hie 
35S 

Ala Pro Lye Ala 
370 

He Gin Leu Asn 
3 85 

Arg He Glu Arg 



Met Leu Pro 

Ser Glu Asp 
435 

Arg Phe Glu 

450 
Lys Asp Ala 
465 

Arg Arg Val 

Glu Leu Arg 

Glu Val Pro 
515 

Gly Val Asn 
530 

Asp Ser Ser 
545 

Val Phe Leu 

Met Cys Hie 

He Asn Val 
595 

He Arg Gly 

610 
Ala Glu Leu 
625 

LOU Val Glu 
Met Pro Phe 



Lys 
420 
Ala 

Arg 

He 

His 

Lye 
500 

Leu 



Met Gly 
325 

Gly Pro 

Asp Ser 

Val Pro 

Yhr Leu 
390 
Val Lys 
405 

Oly Glu 



Ser Gin 
Leu Tyr 
Phe He 

360 
Ala Leu 
375 

Ala Met 
Met Leu 

Glu Val 



Leu Asn Asn Asn 

44 0 

Asn Thr Ser Asp 
Asn Met Thr Leu 

470 

Glu Gin Tyr Ser 

485 

Asn Glu Leu His 



Pro Glu 
Asn Pro Thr 



Ala 

Thr 

Lys 
560 
Thr 

Gly 

Aen 

Pro 

Pro 
€60 



Leu A6J1 
550 
Glu Gly 
565 

Trp Ala 

He Val 

Ala Lys 

Asp Tyr 
630 
Arg Gin 
64 5 
Met 



Ala Pro 
520 
Glu Arg 
53 5 

Leu Thr 

Asp Gly 

Gin Gly 

Glu Met 
600 
Ser Ala 
615 

He Leu 
Leu Ser 



lie He 
33C 
Gly Asn 
345 

Asn Ala 

He Ser 

Tyr Gly 

Gin Thr 
410 
He Trp 
42 5 

Thr Asp 

Ala Phe 

Ser He 

Phe Gly 
490 
His Lys 
505 

Hie Met 
Ala Tyr 
He Asp 
Thr Val 

570 

Ala Ser 
585 

Lys His 

GlU His 

Lys He 

Asn Leu 
650 



Phe Tyr 
Gly Gly 
Ala Gly 
Gly Glu 

3 30 
LeU Glu 
395 

Trp Gly 

Gly Asp 

Thr Tyr 

Asn Lys 
460 
Ser Pro 
475 

Tyr Ser 

His Trp 

Lye I le 

Val Tyr 
540 
Tyr Glu 
555 

Pro Leu 

Pro Tyr 

Gin Pro 

val Asp 
620 
Ala Ser 
635 

Ser Gin 



Phe Met Lys Trp 
335 

Arg Gly Trp Val 
350 

Thr Leu Leu Gly 
365 

Met Lys Asp Thr 

Lys Phe Phe Ser 
400 

Gly He Pro Ser 

415 

Met Lys Ser Ser 
430 

Gly Asn Phe He 
445 

Asn Leu Thr Met 

Glu Trp Leu Gin 
480 

Lys Asn Glu Glu 
495 

Ser Asn Pro Met 
510 

Tyr Cye He Tyr 
525 

Lys Glu Glu Asp 

Ser Lys Gin Pro 
560 

Val Ala His Ser 
575 

Asn Pro Ala Gly 

Asp Arg Phe Asp 
605 

He Leu Gly Ser 

Gly Asn Gly Asp 
640 

Trp Val Ser Gin 
655 
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<210> 3 

<211> 2312 

<212> genomic DNA 

<22 3> Sciii zcsaccharomyces pombe 

<400> 3 

ATGGCGTCTT CCAAGAAGAG CAAAACTCAT AAGAAAAAGA AAGAAGTCAA SO 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG 100 

AGCAACCTXC AGCGTCCGAA ACACAATCTG TTTCAAATAA ATCAAGAAAA 150 

TCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 200 

ATGCGCTGCT TTTTTTTTCG CTGTTGGAGA CGACAATGCT GTTTTCGACC 250 

CTGCTACGTT AGATAAATTT GGGAATATGC TAGGCTCTTC AGACTTGXTT 3 00 

GATGACATTA AAGGATATTT ATCTTATAAT GTGTT7AAGG ATGCACCTTT 3 SO 

TACTACGGAC AAGCCTTCGC AGTCTCCTAG CGGAAATGAA GTTCAAGTTG 4 00 

GTCTTGATAT GTACAATCAG GGATATCGAA GTGACCATCC TGTTATTATG 450 

GTTCCTGGTG TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG 500 

CTCGATTCCT TACTTTAGGA AACGTCTTTG GGGTAGCTGG TCTATGCTGA SS0 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT 600 

AAAAAAACCG GCTTGGATCC GAAGGGAATT AAGCTGCGAG CAGCTCAGGG €50 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 700 

TAATTGAAAA CCTTGCTGCA ATTGGTTATG AGCCTAATAA CATGTTAAGT 75 0 

G CTy CTTACG ATTGGCGGTT ATCATATGCA A*.TTTAGAGG AACGTGATAA 80 0 

ATATTTTTCA AAG'XTAAAAA TGTTCATTGA GTACAGCAAC ATTGTACATA 8 SO 

AGAAAAAGGT AGTGTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 900 

TATTTTTTTA AGTGGGTTGA AGCTGAGGGC TACGGAAATG GTGGACCGAC 950 

TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 1000 

GTTTGACTAC GTTTCTAACT TTTGAATAGA TATCGGGATC TTTGATTGGA 1050 

GCACCCAAAA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 1100 

TATTGTAATT ACATTAAACA TGTTAATATT TAATTTTTGC TAACCGTTTT 115 0 

AAGCTCAATT GAATCAGTTT TCGGTCXATG GGTAAGCAAT AAATTGTTGA 1200 

GATTTGTTAC TAATTTACTG TTTAGTTTGG AAAAATTTTT TTCCCGXTCT 125 0 

GAGGXATATT CAAAAATACA AATGTGCTCT ACTTTTTCTA ACTTTTAATA liOO 

GAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCCTA 1350 

AA3GAGGCGA TGTTGTATGG GGAAATGCCA GTTGGGTAAG AAATATGTGC 140 0 
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TGTTAATTTT 


TTATTAATAT 


TTAGGCTCCA 


GATGATCTTA 


ATCAAACAAA 


1450 


TTTTTCCAAT 


GGTGCAATTA 


TTCGATATAG 


AGAAGACATT 


GATAAGGACC 


1500 


AC'GATGAATT 


TGACATAGAT 


GATGCATTAC 


AATTTTTAAA 


AAATGTTACA 


1550 


GATGACGATT 


TTAAAGTCAT 


GCTAGCGAAA 


AATTATTCCC 


ACGGTCTTGC 


1600 


TTGGACTGAA 


AAAGAAGTO? 


TAAAAAATAA 


CGAAATGCCG 


TCTAAATGGA 


1650 


TAAATCCGCT 


AGAAGTAAGA 


ACASTAAAGT 


TACTAAATTA 


TACTAACCCA 


1700 


AATAGACTAG 


TCTTCCTTAT 


CCTCCTGATA 


TGAAAATTTA 


TTGOGTTCAC 


1750 


GGGGTCGGAA 


AACCAACTOA 


OAGAGGTTAT 


TATTATACTA 


ATAATCCTGA 


1600 


GGGCCAACCT 


GTCATTGATT 


CCTCGGTTAA 


TGATGGAACA 


AAAGTTGAAA 


1850 


ATGTGAGAGA 


ATTTATGTTT 


CAAACATTCT 


ATTAAC TGOTT 


TXATTAGGGT 


1900 


ATTGTTATGG 


ATGATGGTGA 


TGGAACTTTA 


CCAATATTAG 


CCCTTGGTTT 


1950 


G GTGTGGAAT 


AAAGTTTGGC 


AAACAAAAAG 


GTTTAATCCT 


GCTAATACAA 


2000 


GTATCACAAA 


TTATGAAATC 


AAGCATGAAC 


CTGCTGCGl'T 


TGATCTGAGA 


2050 


GGAGGACCTC 


GCTCGGCAGA 


ACACGTCGAT 


ATACTXGGAC 


ATTCAGAGCT 


2100 


AAATGTATGT 


TCATTTTACC 


TTACAAATTT 


CTATTACTAA 


CTCTXGAAAT 


2150 


AAGGAAATTA 


TTTTAAAAGT 


TTCATCAGGC 


CATGGTGACT 


CGGTACCAAA 


2200 


CCGTTATATA 


TCAGATATCC 


AGTACGGACA 


TAAGTTTTGT 


AGATTGCAAT 


2250 


TAACTAACTA 


ACCGAACAGG 


GAAATAATAA 


ATGAGATAAA 


TCTCGATAAA 


2300 


CCTAGAAATT 


AA 
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<210> 4 

<211> 36B5 

<212> genomic DNA 

<213> Arabidopsis thaliana 

<400> 4 
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ATGCCCCTTA 

ATCTGAAGAG 

CTTCCAAATC 

ATCGATTCTT 

TCTTCTCTTC 

CCGAGCGAAT 

AAAGAAGGTC 

CACCGGTGGG 

TTAGAAAACO 

GCTCAACAAT 

TGAGATCACG 

TGTTTGTCTG 

ATTTTCTTAA 

GTAATTGGCT 

TAGCTGCCTT 

ATGCCCTAGA 

GTAGATTTAG 

TGGGAATAGA 

GATAGAGGAC 

CTAGTTCCTT 

TCAATTTCAA 
CATCCTCGTT 

TATATTAAAA 
TCTTCTTCTT 
AArGTTATTG 
TCAATTATCT 
TTGACAATGA 



TTCATCGGAA 
GTGGTGCACG 
CCACCATAAG 
GTTGTTGGTT 
CTTTACAACG 
CACGGGTCCT 
TTAAGGCGAA 

cTCGAGcrrr 

TTTGTGGGGT 
TCTCACTCTT 
CACTTGTTGC 
TCTTACTCTT 
GAG AC TAT TT 
TGGACTATTT 
GGAATTTCAA 
GTC CGTTCAT 
CTTTGTGTAG 
GAAGTTCTAA 
TGTTGCTTTA 
TITGATCTTT 
AGTTTCACAT 
CTGTTATCCA 
AAAAAGTGTT 
TCGGCTCAGT 
TTGATGGTAA 
CTTTGATTCA 
AACTGGGTTG 



AAAGCCGACG 
ATGAGGATTC 
AAATCGAACG 
CATTGGGTGT 
CAATGCCTGC 
TTGCCTGACC 
ACATCCTGTT 
GGGAAGGCAA 
GGAAC TTTTG 
CCTTTATATT 
TTCTTCAACA 
TACTTTTTTT 
CTGTATGTGT 
CTGTTTGATT 
GTCATCTTAT 
AACAAGTTAC 
CGTATAATGA 
CTACATCTGT 
TTATTCAACT 
CAGCTCAATG 
CGAGTTTATT 
GCTTTGAACT 
TTGTGGGTTG 
GTTCATGTTT 
CAGTGGTATA 
GGCCTCTATG 
GATCCAGCTG 



GAGAAACCAT 
GCAAAAGAAA 
GAGGAGGGAA 
GTGTGTGTAA 
GAGCTTCCCT 
CGCCCGGTGT 
GTCTTCATTC 
ACAATGCGCT 
GTGAAGTCTA 
GGGATTTGGA 
TCACTCAAAC 
TTTTTTTGAT 
AAGGTAAGCG 
GTTAACTTTA 
TGCCAAATCT 
TTCCTTTACT 
AGTAGTGTTT 
GGAAAGTGTG 
ATGTATATGT 
TGCXTTTCTC 
CACATGTCTT 
CCTCCCGACC 
CATCTTTGTT 
TTGCTATGGT 
GTTGATAGTA 
TTGGOTGGAA 
GTATTAGAGT 



CG ACGCCGCC 
CCACACGAAT 
GTGGTCGTGC 
CCTGGTGGTT 
CAGTATGTAA 
TAAGCTCAAA 
CTGGGATTGT 
GATGGTTTAT 
CAAAAGGTGA 
TTGGATCTGA 
TTTAATTCCA 
GTGAAACGCT 
TTCCAAGGAC 
GGATATAAAA 
GTTGCTAGAC 
GTCGTTGCGT 
TATGTTTTGT 
TTCAGGC TGT 
GTAATTAAAG 
AATTTTTTTC 
GAATTTCGTC 
CTGCTATGGA 
ACGATCTGCA 
AGAGATGGOC 
TCTTAACTAA 
CACATGTCAC 
TCGAGCTGTA 



50 

100 

ISO 
200 
2S0 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
BOO 
650 

soo 

950 
1000 
1050 
1100 
1X50 
1200 
1250 
1300 
1350 
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TCAGGACTCG TGGCTGCTGA 
AGTGCTGATT GCTAACCTTC 
ACATGGCTGC ATATGACTGG 
TTCTCATOGT TCTTTCTATT 
TTACTTAAGG CTTAAATATG 
GACVCTTAGC CGTATGAAAA 
GTGGAAAAAA AGCAGTTATA 
CTACATTTTA TGAAGTGGGT 
TGGGCCACAT TGGTGTGCAA 
G ACC ATTTCT TGGTGTTCCA 
GCAAAGGATG TTGCAGTTGC 
TGATCAGAAC CTTGGCTTCTG 
TCTAATAACA TTtSCTATATT 
TTGCi'GCTTA TGTAACTGAA 
GATAATTCTT ACGCATTGCT 
TAACATTTGT CATACTGTCT 
GCGCTGGAGC TTCCATGCTT 
TCTTTCAATT TTCTTGTATA 
TAAAGACTCG TTGGATTAGT 
AGAACTTTAC TTTCTTCGAA 
GACACCGATA TATTTAGACT 
ACGCACATGG GACTCAACAA 
YATGGGGCGG GCTTGATTGG 
AAAAAGCAAA AGAACAACGA 
TTCCAAGAAA AGTCCTGTTA 



CTACTTTGCT CCTGGCTACT 
CACATATTGG ATATGAAGAG 
CGGCTTTCGT TTCAGAACAC 
ATTCTGTTCC ATGTTACGTT 
TTTCATCTTG AATTAATAGG 
GTAATATAGA GTTGATGGTT 
CTTCCCCATT CCATGGGGGT 
TGAGGCACCA GCTCCTCTGG 
AGTATATTAA GGCGGTGATG 
AAAGCTGTTG CAGGGCTTTT 
CAGGTATTGA ATATCTGCTT 
GAACTCAAAG TTATTCTACT 
ATCGCTGCAA CTGACATTGG 
ACTCTCTTGA GATTAGACAA 
CTGTGATGAC CAGTTTCTTA 
TTTGGAGGGC ATTGAATlTf 
GCATTCTTTA CCAATTAGCG 
TGCATCTATG GTCTTTTATT 
TGCTCTATTA GTCAC TTGGT 
AATTGCAGAG CGATTGCCCC 
TCAGACCTTG CAGCATGTAA 
TGTCTATGTT ACCQAAGGGA 
TCACCGGAGA AAGGCCACAC 
AACTTGTGGT GAAGCACGTG 
ACTATGGAAG GATGATATCT 



TTGTCTGGGC 14 0 0 

AAAAAT ATGT 1450 

AGAGGTTCTT 1500 

TCTTTCTTCA 15S0 

TACGTGATCA 1600 

TCTACCAACG 1650 

CTTGTATTTT 17 00 

GTGGCGGGGG 1750 

AACATTGGTG 1*00 

CTCTG CTG AA 185 0 

ATACTTTTGA 1900 

AAATATCAAT 1 $5 0 

TTCATTATTC 2 000 

ATGATG AATT 2 050 

GCTTCGACGA 2100 

GCTATGGAAA 2150 

TTATTCTGCT 2200 

TCTTCTTAAT 2250 

TCCTTAATAT 5300 

AGG ATTCTTA 2350 

TGAGAATGAC 2400 

GGTGACACGA 2450 

CTGTTGTGGG 2500 

AAAACGGAGT 2 550 

TTTGGGAAAG 2600 
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CTATAGATCT ACTACATTAT GTTGCTCCTA AGATGATGGC GCGTGGTGCC 
GCTCATTTCT CTTATGGAAT TGCTGATGAT TTGGATGACA CCAAGTATCA 
AGATCCCAAA TACTGGTCAA ATCCGTTAGA GACAAAGTAA GTGATTTCTT 
GATTCCAACT GTATCCTTCG TCCTGATGCA TTATCAGTCT TTTTGTTTTC 
GGTCTTGTTG GATATGGTTT TCAGCTCAAA GCTTACAAAG CTGTTTCTGA 
GCCTTTCTCA AAAAGGCTTG CTCAGTAATA TTGAGGTGCT AAAGTTGATA 
CATGTGACTC TTGCTTATAA ATCCTCCGTT TGGTTTGTTC TGCTTTTTCA 
GATTACCGAA TGCTCCTGAG ATGGAAATCT ACT CATT AT A CGGAGTGGGG 
ATACCAACGG AACG AG CAT A CGTATACAAG CTTAACCAGT CTCCCGACAG 
TTGCATCCCC TTTCAGATAT TCACTTCTGC TCACGAGGAG GACGAAGATA 
GCTGTCTGAA AGCAGGAGTT TACAATG'XGG ATGGGGATCA AACAGTACCC 
GTCCTAAGTG CCGGGTACAT GTGTGCAAAA GCGTGGCGTG GCAAGACAAG 
ATTCAACCCT TCCGGAATCA AG AC TTAT AT AT.GAGAATAC AATCACTCTC 
CGCCGGCTAA CCTGTTGGAA GGGCGCGGGA CGCAGAGTGG tgcccatgtt 
GATATCATGG GAAACTTTGC TTTGATCGAA GATATCATGA GGGTTGCCGC 
CGGAGGTAAC GGGTCTGATA TAGGACATGA CCAGGTCCAC TCTGGCATAT 
TTGAATGGTC GGAGCGTATT GACCTGAAGC TGTGA 



2900 
2950 
3000 
3050 
3100 
3150 
3200 
3250 
3300 
3350 
3400 
3450 
3500 
3550 
3600 
3650 
3685 
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<2l0> 5 
<211> 402 
<212> cDNA 

<213? Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (120) • > (402) 
<400> 5 

AGAAACAGCTCTTTGTCTCT CTCGACTGATCTAACAATCC CTAATCTGTGTTCTAAATTC SO 

CTGGACGAGATTTGACAAAG TCCGTATAGCTTA AC CTGGT TTAATTTCAAGTGACAGAT 119 

ATG CCC CTT ATT CAT CGG AAA AAG CCG ACG GAG AAA CCA TCG ACG CCG 167 
Met Pro Lou He His Arg Lys Lys Pro Thr Glu Lys Pro Ser Thr Pro 

CCA TCT GAA GAG GTG GTG CAC GAT GAG GAT TCG CAA AAG AAA CCA CAC 215 
Pro Ser Glu Glu Val Val His Asp Glu Asp Ser Gin l>y& Lye Pro His 

GAA TCT TCC AAA TCC CAC CAT AAG NAA TCG AAC GGA GGA GGG AAG TGG 263 
Glu Ser Ser Lye Ser His His Lys ??? Ser Asn Gly Gly Gly Lys Trp 

TCG TGC ATC GAT TCT TGT TGT TGG TTC ATT GGG TGT GTG TGT GTA ACC 311 
Scr Cys He Asp Ser Cys Cya Trp Phe life Gly Cys Val Cys val Thr 

TGG TGG TXT CTT CTC TTC CTT TAG AAC GCA ATG CCT GCG AGC TTC CCT 3 59 
Trp Trp Phe Leu Leu Phe Leu Tyr Asn Ala Met Pro Ala Ser Phe Pro 

CAG TAT GTA ACG GAG CCG AAT CAC GNG TCC TTT GCC TTA CCC G 4 02 

Gin Tyr Val Thr Glu Pro Asn His ??? Ser Phe Ala Leu Pro 
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<Z10> 6 
<211> 643 
<212> cDNA 

<220> 
<221> CDS 
<222> (1) - - (402) 
<400> € 

CGG GAG AAA ATA GCT GCT TTG AAG GGG GGT GTT TAG TTA GCC GAT GGT 4 6 
Arg Glu Lys He Ala Ala Leu Lye Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

GAT GAA ACT GTT CCA GTT CTT AGT QCG GGC TAG ATC TGT GCG AAA GGA 96 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CGT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 144 
Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly ser tys Thr Tyr Val 
35 40 4S 

AGA GAA TAC AGC CAT TCG CCA CCC TCT ACT CTC CTG GAA GGC AGG GGC 192 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

ACC CAQ AGC GGT GCA CAT GTT GAT ATA ATG GGG AAC TTT GCT CTA ATT 240 
Thr Gin Ser Gly Ala His Val Asp He Met Gly Asn Phe Ala Leu He 
65 70 75 80 

GAG GAC GTC ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAA ATT GGT 266 
Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 SB 

GGC GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 3 3$ 
Gly Asp Gin Val Tyr Ser Asp lie Phe Lys Trp Ser Glu Lye He Lys 
100 10S HO 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG TTTATTG CGTTCC 3 91 

Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACTCTGGAT TTTGCTTAAA TATTGTAATT TTTCACGC 44 9 

TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CGATGTTG 507 

TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTG CAT TTGCCAGC 565 

TGAAATTGTG TAGTCGTTTT CTTTACGATT TAATANCAAG TGGCGGAGCA GTGCCCCA 62 3 

AGCNAfrAAAA AAAAAAAAAA 643 
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<210> 7 

<2ii> ai5 

<212> PRT 
<513> Zea mays 
<400> 7 



Arg Glu Lya lie Ala Ala Leu Ly0 Gly Gly Val Tyr Leu Ala Asp Qly 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

Trp Arg Gly Lye Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

Arg GlU Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

Thr Gin ser Gly Ala Hie Val Asp lie Met Gly Asn Phc Ala Leu He 
65 70 75 30 

Glu Asp Val lie Arg lie Ala Ala Gly Ala Thr Gly Glu GlU lie Gly 
85 90 35 

Gly Asp Gin Val Tyr Ser Asp lie PHe Lys Trp ser Glu Lys He Lye 
100 105 110 



Leu Lys Leu 
115 



111 
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c210> 8 
<211> 616 
<P.12* cDNA 

<213> Neuroapora. eraser 
<400> 8 



GGTGGCGAAG ACGANGGCGG 
AGATGGATCT ACCCTCTAGA 
GTNTACNGTT TNTATGGGTA 
TGOCGCCCGA TCCCGGGACG 
ACTTTGACTW aggggcacat 
TGGCACAGTG AACCTTATGA 
AAATGAAGAG ATACAATCCT 
CCGCATGAAC CAGAACGG1T 
CTTAAATATG TAGAAAAGGT 
ACATAGGTTA CTCAATAGTA 
AAAAAAA^AA AAAAAA 



AAGTTGGAGG CTAACGAGAA 
GACACGACTA CCNTTGCACC 
GGAAGCCGAC GGAGCGAGCC 
ACAACGCATC TTTAGATGAC 
TGACCACGGT GTGATTTTGG 
GTTTGGGGTA CCTGTGCAAT 
GCGGGCTCAA AAATAACCGT 
CAATCCGAGA GGAGGG CCGA 
XGAAATTTAT GAAGAGTAAT 
TGACTAATTA AAAAAAAATT 



TGACNCTCGG 5 0 

CAGCCTCAAG 100 

TACATCTATC 15 0 

GATCGATACG 2 00 

GCGAAGGCGA 250 

AAGGGGTGGA 30 0 

GGTCGAGATG 350 

ATACGGCGGA 500 

TAAATACGGC 550 

TTTTTTCTAA 600 
616 
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<210> 9 

<21X> 1S62 

<212> genomic DNA 

<213> Arabidopsis theiliana 

<400> 9 

ATGAAAAAAA TATCTTCACA TTATTCGGTA GT CAT AGCGA TACTCGTTGT 5 0 

GGTGACGATG ACCTCGATGT GTCAAGCTGT GGGTAGCAAC GTGTACCCTT 100 

TGATTCTOGT TCCAGGAAAC GGAGGTAACC AG CTAGAGGT ACGGCTGGAC 150 

AGAGAATACA AGCCAAGTAG TGTCTGGTGT AGCAGCTGGT TATATCCGAT 200 

TCATAAGAAG AGTGGTGGAT GGTTTAGGCT AT GGTTCGAT GCAGCAGTGT 25 0 

TATTGTCTCC CTTCACCAGG TGCTTCAGCG AT CGAATGAT GTTGTACTAT 300 

GACCCTGATT TGGATGATTA CCAAAATGCT CCTGGTGTCC AAACCCGGGT 35 0 

TCCTCATTTC GGTTCGACCA AATCACTTCT AT AC CTCG AC CCTCGTCTCC 4 00 

GGTTAGTACT TTCCAAGATA TAT CATTTTG GGACATTTGC ATAATGAACA 45 0 

AAATAGACAT AAATTTGGGG GATTATTGTT ATATCAATAT CCATTTATAT 500 

GCTAGTCGGT AATGTGAGTG TTATGTTAGT ATAGTTA ATG TGAGT GTTAT 550 

GTGATTTTCC ATT TTAAATG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 60 0 

CTATGTCATG AGAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA 65 0 

GGTTTGATTT GCAGAGATGC CACATCTTAC ATGGAACATT TGGTGAAAGC 70 0 

T C TAG AG AAA AAATGCGGGT ATGTTAACGA CCAAACCATC CTAGGAGCTC 750 

CATATGATTT CAGGTACGGC CTGGCTGCTT CGGGCCACCC GTC CCGTGTA 8 00 

GCCTCACAGT TCCTACAAGA CCTCAAACAA TTGGTGGAAA AAACTAGCAG 850 

CGAGAACGAA G GAAAGCC AG TGATACTCCT CTCCCATAGC CTAGGAGGAC 900 

TTTTCGTCCT CCATTTCCTC AACCGTACCA CCCCTTCATG GCGCCGCAAG 950 

TACATCAAAC ACTTTGTTGC ACTCGCTGCG CCATGGGGTG GGACGATCTC 100 0 

TCAGATGAAG ACATTTGCTT CTGGCAACAC ACTCGGTGTC CCTTTAOTTA 1050 

ACCCTTTGCT GGTCAGACGG CATCAGAGGA CCTCCGAGAG TAACCAATGG 1100 

CTACTTC CAT CTACCAAAGT GTTTCACGAC AGAACTAAAC CGCTTGTCGT 1150 

AACTCCCCAG GTTAACTACA CAGCTTACGA GATGGAT CGG TTTTTTGCAG 1200 

ACATTGGATT CTCACAAGGA GTTGTGCCTT ACAAGACAAG AGTGTTGCCT 12 50 
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TTAACAGAGG AGCTGATGAC TCCGGGAGTG 
GAGAGGAGTT GAT AC AC CGG AGGTTTTGAT 
ATAAGCAACC AGAGATTAAG TATGGAGATG 
GCGAGCTTAG CAGCTTTGAA AGTCGATAGC 
TGGAGTTT CG CATACATCTA TACTTAAAGA 
TTATGAAGCA GATTTCAATT ATTAATTATG 
GTCAATGAAT GA 



CCAGTCACTT GCATATATGG 13 00 

GTATGGAAAA GGAGGATTCG 1350 

GAGATGGGAC GGTTAATTTG 1400 

TTGAACACCG TAGAGATTGA 1450 

CGAGATCGCA CTTAAAGAGA 15 00 

AATTAGCCAA TGTTAATGCC 1550 

1S62 
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<210> 10 

<2ll> 3396 

<212> genomic DNA 

«;213> Arabidopsis thaliana 

c400> 10 



atgggagcga 
ttttttcttg 
ttcacgccga 
tcgacgcagc 
ggacttcaat 
atcttcattt 
aattccaagc 

TCGTTCATTA 
AAAACAGCTG 
TAGCGCAATG 
CAAGTTTTTC 
CTTGTGCATT 
TGGTTTACAG 
TAG AT CCTTA 
AGTGGTCTTT 

rrrcGGATTT 

TGTGATATAA 
Tl'TCTACTGT 
GAAGCAAATG 
C AAATTGG AA 
TCAGCCTAAT 
TCTGGTCGTC 
C CGTGGCGGC 
TCAGATACTT 
TXGAAGTGGC 
CCTACTATCC 
TGTXGTGACT 
GAGCTCCTCT 
GTAAOGTTTC 
TTTAAGTAGT 
TGAAAGTATT 
TAGATCTXGA 
GTCAGCGGCC 
TTCAGATTAT 
CTTTTTATTT 
TGACATGCGC 
GGTTGTTGTC 
AAGAATTGCA 
TGCAAAGAAA 
CAAAATATTC 
ACT AG CGGTT 
CCAAGAATGT 
TACAGAAACA 
CCCTTTTGTC 
GCAATAGAAG 
GAAGTACGTA 
TTGCTGGCTT 
TGTTCATATG 
CCTTAXTATT 
GTTTACAGTT 
ATTCTCTAGT 
TCTTTGTGAA 
TGATGACCCT 
AAAATGTATT 
ATTCTCAATA 



ATTOGAAATC 
ATTTGCGGTG 
CTACTCGAAG 
TACGAGCGTG 
CCGCTCGACC 
CCTTCGCTCC 
GAAATATAGC 
GTCAACAGTG 
ACTCGGCGAG 
AATGTGTAAT 
AGAGTGCTCA 
GTGATTCTTX 
CTTCTTTCVG 
TAATCAAACA 
CAGCCATCAC 
TTCTTTCTTT 
TATGGCTAAG 
CTCGAAAGAG 
CAATTGTCGC 
GAGCGTGACC 
GTCTTXTATC 
TTCCTTTTTG 
CCTTCTATAG 
TCTGGAATGG 
TTGATCAGCA 
TTAAGTTACC 
TACTGGATTG 
TCTTGGTTCT 
GCCTTCCTGT 
TGATATCAAC 
ACTTTTGTTA 
AGTGCTAGTT 
TTAGCTAATA 
TATGGTAGAC 
TAATAGGCTA 
TTCTCATGTT 
CAATTCTTTT 
AGGGTGATAA 
GAXAAGCGCG 
TGGCTGGCCG 
AG ACTCTGTA 
TCACTCTCAT 
GCTCTAGTCA 
TTTCACAGCC 
ACTATGACCC 
CGTTTCTTTG 
CTTGTACGTC 
CTTTGTCTTT 
GATTATCAGT 
ATGAATGCAA 
TTGTTTTGAC 
TTATATATAA 
GTTTTTAATC 
TTGCATATAT 
TCACATTATG 



AGTAACGGCT 
GCCGAACTGC 
CTATCGGGTA 
GTCGATCCTT 
TCGTATGGCT 
TTATTCTG'XC 
AATGAAGCAT 
ACGCTTCTGA 
TGTTTCCCAT 
TAGTCTGCGG 
ATAGTAGTTA 
TGGTTGTTGC 
CTGTCAACTG 
GACCATCCCG 
AGAATTGGAT 
TGAGTTTTCT 
TTCATTAATT 
TOO CTTAAGT 
TGTTCCATAC 
TTTACTTTCA 
TTCTCTXXTT 
CAGGTTGACC 
TATTTGCCCA 
CTGAGGCTAG 
TATCCATGCT 
ATTTTATTTT 
AGCTCGATAC 
GTTGAGGCAA 
TTCTGAGGTG 
CAGGTCTTAT 
ATTGAACTGC 
ATCAAAGAAC 
CAACCAAACC 
TTTAAGTTGA 
TGATTTGTTT 
TTTTGTTGCC 
GCGTCGTCAT 
CACATTCTGG 
TATACCACTG 
ACAAATATTA 
TATGCAACTG 
ATTTCGTTCC 
ACATGACCAG 
CGTGAACTAG 
AGATAGCAAG 
TOATAAGAAA 
AAATTGTTTT 
CTTACTATAA 
TCTCTCCTTA 
AAGGGGGTAT 
TAATAGCGTC 
CATGCTAACT 
CTCTGACTCC 
GGTGCTCATC 
CGTTGACTXT 



TCCTTCACCG 
GGTGGAGGAT 
TAATCATTCC 
GACTGTCCAT 
AGACACCACT 
GGTCGAGTCA 
GTCTCGTCTC 
ATCTGAGTTT 
CGCTTTTGGT 
TTTTTATTCA 
GAAAATGTTA 
TTACTGATCG 
CTGGTTTAAG 
AGTGTAAGTC 
CCAGGTTACA 
TCAATTTGAT 
TGGTCAATTT 
GGTGTGTXGA 
GATTGGAGAT 
CAAGCTCAAG 
ATGTAAGATA 
TTTGAAACTG 
TTCAATGGGT 
AAATTGCACC 
TATTTCGCTG 
TTC TC TAATT 
CTGATTTGTT 
TCAAATCTAC 
ACCTCTGACT 
AAC TC ACT GG 
TGTACGCGAT 
ATATTGTGGG 
ACATGTACAC 
GAAGAAACTT 
ATTGAAATCA 
AAGGCTTCAG 
TGTGGCTTAT 
ACGCATTTTT 
TGATGAAGAG 
TTAACATTGA 
TAACACTAAC 
TXTGATGTGT 
CATGGAATGT 
CAGATGGGAC 
AGGATGTTAC 
TATXGCTCAT 
GTTTAAATC t 
GAAACAAGTA 
TATTATGGAA 
TTTAGTTGAT 
AATTTTGTTX 
ATACTTTTCA 
TTGGGAGAGA 
TAAAGACAGA 
GTTATTATAT 



TCATCGCCGT 
GAGACCGAGT 
GGGATTTGCG 
AC AC TCCGTT 
AAGGTCCGTG 
CTTGTTGATG 
TCTTATTGAT 
AGAGTCATAT 
TCGCTAAATG 
ACTAGATCTG 
GGTCATTTTA 
ACGTGATGOA 
TGTATGGTGC 
ACGGCCTGAC 
TAACAGGTAG 
ATCATCTTGT 
TCAGGTCCTC 
GTTTGGTATA 
TGTCACCAAC 
TTAGTCCTTA 
AGCTAAGAGC 
CTTTAAAACT 
AATAATGTCT 
AAAACATTAT 
TTGGTACCGG 
GGGGGAGTTA 
GTTGATTTAG 
TCTCTCTGGT 
TCTCTTTAGT 
ATTTTCCTTT 
ATGGTATCTG 
TAGTATACCT 
TGATTTAGTT 
TGACTGAAAT 
TGTGACATAT 
GGAACTGCTC 
GCOATTTTCA 
CTGGGGGTGC 
GAATATCAAT 
AATTCCTTCC 
AAAAGTTTCA 
AT CC ATCAGT 
GGCCTTOCCA 
TCTTTTCAAA 
ACCAGTTAAA 
CGATCATCAC 
CTATATCAAT 
TAATCAGAAA 
TGTCTTTTTC 
TGATTCTCTC 
TTCTAGCAAA 
GGTTGTATCA 
CCACCTATAA 
GGTATGATGC 
TCCCCATTTG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
60O 
650 
700 
•750 
600 
£50 
900 
$50 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1300 
1850 
1900 
1950 
200 
2050 
2100 
2150 
2200 
2250 
2300 
2350 
2400 
2450 
2500 
2550 
2600 
2(550 
2700 
2750 
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GTTTGCAATA TCTTTTTGAA 
GCTATTAAGC GTTAAACGTA 
GGTTATTALTl' Ti'tsUCCCAAU 
CACGGATATC ATTTATGAAA 
CCGCAATGGC AGAAGTAAAA 
GTGGCATGTT ATCTCAGTTG 
AAGTACTTTT TTATCATTCC 
AAGTGGGAAG AGGTGTTGCA 
AGCAAAACAA AACTAACCCA 
GTGCTTTTAA AAAATTTOTT 
GATTOTGCAA TATCTGCAGG 
CCTATAACTG GGGATGAGAC 
CTTCTTG CAA ACTACTGAAG 
CTTGCTATGT TCTCTAGTAC 
TGATTATGAA ATTGATCTCT 
CCAAGAATTG GCTCGGACCT 
CTCTTTTTTA GTTCCTCACC 
CTGOTTATGT GTTGATTTAC 
TCTCTGTACT CCTCAfcGAAC 
GAAAATAAAA CAACAGCCAG 
TAAATGTTGA TCATGAGCAT 
GCACCAAGGG TTAAGTACAT 
GGGGAAGAGA ACCGCAGTCT 



TTATGATTTA TCTTCTCCCT 
CTAAATGTAT GAAGCTGTCT 
'ivy CAftHUL' r 'X /VrUUM'tt/Vl *k 
CTGAAGGTTC CCTCGTGTCA 
CAGGAAGGCA AAGTCTTCTG 
CATAAGCAAA TTATTAAACA 
TTTTGAGCTT AGTGGATGAT 
TGAAACATGA CACTTGTATC 
TTTCTGAATT TCATATTATT 
TTAAGAAACC GAAAAACTAG 
TCTGGAACTG TGGTTGATGG 
GGTAAGCTCA GAAGTTOGTT 
ACTAAGATAA TACTTGCTTC 
ACTGCAATAT TGACTCTCCG 
TATAGGTACC CTATCATTCA 
AAAGTTAACA TAACAATGGC 
TTATATAGAT CAAACTTTAA 
CTCCAMTTG TTCTTTCTAA 
TTGTATTAAT CTAAAOGAGA 
AACACGATGG AAGCGACGTA 
GGGTCAGACA TCATAGCTAA 
AACCTTTTAT GAAGACTCTG 
GGGAGCTTGA TAAAAGTGGG 



TGCATCTTAT 


2 800 


GTCATAGGTT 


2 950 
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AAAGATAACT 


3150 


AGGAGTAGTC 


3200 


TTCATATCTT 


3 25 0 


GAACGCTGGA 


3300 


TT G AAATT AT 


335 0 


TG GAACACTG 


340 0 


CTACTTTTAT 


3450 


CXCTCTTGGT 


3500 


TCCCCAGGTA 


3550 


GTGTACTTTT 


3600 


AftATCATATA 


3650 


TTCTCATTGC 


3700 


CATGTGGAAC 


3750 


CATGACAAAA 


3900 


AGAGCATTCC 


3850 


XATTAA 


36*6 



<210> 11 
<211> 70S 
<212> CDNA 
<213> ton>ato 
<4G0> 11 

CTGGGGCCAA AAGTGAACAT AACAAGGACA 

TCAGATGTAC AAGTGCATCT AAATATAGAG 

CATTCCCAAT ATGACAAAGT TACCTACAAT 

AGGATTCTGA AAGTTTTCCA GGGACAAGAA 

AAAGCAAATC ACAGGAACAT TGTCAGATCT 

GTGGCTTGAG ATGTCGCATG ATATTCATCC 

TTACAAAAGG TGGTGTCTGA TCCTCACTAT 

GXTTGTATTG ACATTGTAAG TATTGCAACA 

TGAGGGATGA GGACTGCTAT TGGGATTACG 

GCTGAACATT GTGAATACAG GTTAGAATAT 

TATTCTCTTT TTGTG T ATTT AGGCCACCTT 

GATATGTATT CGGGGATGTT CACCTGGGAC 

TCTACATCTC ACATCCTGTC ACACTATGTG 

TTGGOGGAAC AACAAGTTTG CACAAACATT 
TTCAGAGAG 



CCACAGTCAG AGCATGATGT 50 

CATCAACATG GTGAAGATAT 100 

GAAGTACATA ACCTATTATG 150 

CAGCAGTXTG GGAGCTTGAT 200 

CCAGCTTTGA TGCGGGAGCT 25 0 

TGATAAAAAG TCCAAGTTTG 3 00 

TTTCTTCTAT AAATGTTTGA 350 

AAAAGCAAAG CGTGGGCCTC 400 

GGAAAGCXCG ATGTGCATGG 45 0 

TCAAATTATA TTTTGCAAAA 50 0 

TCCCCGGTCA CAACGATGCA SSO 

AGAGTTGCAG ATTG A AG AG T 600 

TGATATTTAA GAAACTTTGT £50 

TGAAGAAGAA AGCGAAATGA 700 

709 
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